Sentence-Internal Prosody Does not Help Parsing the Way Punctuation Does
نویسندگان
چکیده
This paper investigates the usefulness of sentence-internal prosodic cues in syntactic parsing of transcribed speech. Intuitively, prosodic cues would seem to provide much the same information in speech as punctuation does in text, so we tried to incorporate them into our parser in much the same way as punctuation is. We compared the accuracy of a statistical parser on the LDC Switchboard treebank corpus of transcribed sentence-segmented speech using various combinations of punctuation and sentence-internal prosodic information (duration, pausing, and f0 cues). With no prosodic or punctuation information the parser’s accuracy (as measured by F-score) is 86.9%, and adding punctuation increases its F-score to 88.2%. However, all of the ways we have tried of adding prosodic information decrease the parser’s F-score to between 84.8% to 86.8%, depending on exactly which prosodic information is added. This suggests that for sentence-internal prosodic information to improve speech transcript parsing, either different prosodic cues will have to used or they will have be exploited in the parser in a way different to that used currently.
منابع مشابه
Psycholinguistics Cannot Escape Prosody
Once, sentence processing research set aside prosody in order to focus on syntactic and semantic processing. Experimental sentences were mostly presented visually, often without prosodic markers such as commas. Now that we have made some progress by this ‘divide and conquer’ approach, and now that the technology for working on speech has improved, it may be time to integrate prosody into proces...
متن کاملPunctuated Parsing: Signposts Along the Garden-Path
Although there has been some speculation concerning the role played by punctuation in parsing, there has been amazingly little empirical investigation of the issue. Punctuation appears to be a widely neglected topic. For the most part, where punctuation has been included in parsing studies, investigators have simply assumed that punctuation, such as commas, can be used to effectively disambigua...
متن کاملThree Dependency-and-Boundary Models for Grammar Induction
We present a new family of models for unsupervised parsing, Dependency and Boundary models, that use cues at constituent boundaries to inform head-outward dependency tree generation. We build on three intuitions that are explicit in phrase-structure grammars but only implicit in standard dependency formulations: (i) Distributions of words that occur at sentence boundaries — such as English dete...
متن کاملCommas and Spaces: The Point of Punctuation
While it has been widely assumed that punctuation may play a critical role in parsing, there has been relatively little direct empirical investigation of its effects. Most researchers have either avoided the use of punctuation or have simply assumed that it will serve a disambiguating role. There has been little or no consideration of how ’disambiguation’ might occur or whether it is equally ef...
متن کاملTowards a Syntactic Account of Punctuation
Little notice has been taken of punctuation in the field of natural language processing, chiefly due to the lack of any coherent theory on which to base implementations. Some work has been carried out concerning punctuation and parsing, but much of it seems to have been rather ad-hoc and performance-motivated. This paper describes the first step towards the construction of a theoretically-motiv...
متن کامل